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Abstract — The problem of detection and possible estimation 
of a signal generated by a dynamic system when a variable 
number of noisy measurements can be taken is here considered. 
Assuming a Markov evolution of the system (in particular, 
the pair signal-observation forms a hidden Markov model), a 
sequential procedure is proposed, wherein the detection part is 
a sequential probability ratio test (SPRT) and the estimation 
part relies upon a maximum-a-posteriori (MAP) criterion, gated 
by the detection stage (the parameter to be estimated is the 
trajectory of the state evolution of the system itself). A thorough 
analysis of the asymptotic behaviour of the test in this new 
scenario is given, and sufficient conditions for its asymptotic 
optimality are stated, i.e. for almost sure minimization of the 
stopping time and for (first-order) minimization of any moment 
of its distribution. An application to radar surveillance problems 
is also examined. 

Index Terms — Asymptotic optimality, hidden Markov models 
(HMM), sequential detection and estimation, SPRT. 



I. Introduction 

MANY statistical decision problems in engineering ap- 
plications require to perform state estimation of a 
dynamic system under uncertainty as to signal presence [1]- 
[3]. This includes fault detection and diagnosis in a dynamical 
system control [4], [5], target detection and tracking [6], 
image and speech segmentation [7], speaker identification 
and source separation, blind deconvolution of communication 
channels. Application of sequential decision rules to the above 
scenario arouses much interest since it promises a considerable 
gain in sensitivity, measured by the reduction in the average 
sample number (ASN), with respect to fixed sample size (FSS) 
procedures. These advantages are particularly attractive in 
remote radar surveillance, where the signal amplitude is weak 
compared to the background noise and stringent detection 
specifications can be met only by processing multiple frames 
as in [8]— [10]). In this case, FSS techniques usually result to be 
inefficient while sequential procedures are known to increase 
the sensitivity of power-limited systems or, alternatively, to 
reduce the ASN. 

The adoption of sequential procedures, however, poses 
some difficulties: since the instant when the procedure stops 
sampling is not determined in advance (it is a random stopping 
time, indeed) the set of trajectories of the dynamic system 
to be considered (i.e. the parameter space) has an infinite 
cardinality. On the other hand, sequential testing rules have 
been already extended to the case of composite hypotheses. In 
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1 1 1 ] an SPRT is adopted in a radar framework assuming a prior 
on the parameter space, in turn consisting of a finite number 
of elements (the radar resolution cells). Sub-optimal sequential 
classification procedures (also called multi-hypotheses tests) 
were also proposed during the past years, such as [12]— [19] 
for the case of independent and identically distributed (i.i.d.) 
observations and [20]-[24] for the more general setting of 
non i.i.d. observations. However, all of these studies were 
restricted to a finite cardinality of the parameter space, an 
overly restrictive condition, which corresponds to requiring 
that the dynamic system may only lie in a determined state, 
with no transition allowed. Few works in the past have studied 
sequential problems for hidden Markov models (HMMs), 
which are known to admit a dynamical system representation 
in the sense of control theory [25]. In [26] the performances 
of SPRTs for model estimation in parametrized HMMs and 
the cumulative sum (CUSUM) procedure for change point 
detection in HMMs are studied, while [27] addresses the 
quickest detection of transient signals represented as HMMs 
using a CUSUM-like procedure, with possible applications to 
the radar framework. 

This paper addresses the problem of sequential detection 
and trajectory estimation of the state evolution of a dynamical 
system observed through noisy measurements. In the above 
framework, its contributions can be summarized as follows. 

- At the design stage, a sequential procedure is defined 
with no restriction as to the parameter space cardinality. 
The detection part of the procedure realizes an SPRT 
while, in order to estimate the system state trajectory, 
a gated estimator is defined, in the sense that estimation 
is enabled by the result of the detection operation. 

- It is known that Wald's SPRT for testing simple hy- 
potheses based on i.i.d. observations has a number of 
remarkable properties [28], [29], the most appealing be- 
ing the fact that it simultaneously minimizes the expected 
sample size under both hypotheses. These properties, 
however, fail to hold when the observations are not i.i.d., 
as it happens when they are generated by a dynamic 
system. In this paper, a deep asymptotic analysis for the 
detection part is given and sufficient conditions under 
which these properties hold are stated, consistent with 
previous results in [20], [21], [23]. In particular, it is 
shown that under a set of rather mild conditions the test 
ends with probability one and its stopping time is almost 
surely minimized in the class of tests with the same or 
smaller error probabilities. Furthermore, reinforcing one 
of such conditions, it is also shown that any moment of 
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the stopping time distribution is first-order asymptotically 
minimized in the same class of tests. 

- At the application stage, the general problem of multi- 
frame target detection and tracking for radar surveillance 
is considered: in this way, previous limitations on target 
mobility imposed by other studies are avoided. 

- Finally, a thorough performance analysis is given, aimed 
primarily at showing the correctness of the asymptotic 
analysis and at investigating the effects of system pa- 
rameters. The superiority of sequential detection and 
estimation rules with respect to FSS techniques is also 
shown in the afore-mentioned radar application. 

The rest of the paper is organized as follows. Next sec- 
tion presents the elements of the problem while section [Til] 
addresses the sequential detection and estimation problem. 
Section [IV] presents the asymptotic results while section [V] 
covers the radar surveillance problem. Finally, section [VT] 
is devoted to the presentation of numerical results, while 
concluding remarks are given in section \VU\ For reader's sake, 
some notation, used throughout the rest of the paper, is first 
introduced. 

Notation: In what follows, all random variables are 
defined on a common probability space (f2,jF, P) and are 
denoted with capital letters. Lower case letters are used to 
denote realizations of random variables while calligraphic let- 
ters to denote sets within which random variables take values. 
(T-algebras are denoted using script letters, a(X) being the 
smallest er-algebra generated by the random variable X. Xi :J 
will be used to denote segments of random variables taken 
from the process {X k } keZ : specifically, = {X k } J k=i 

for i < j, and X_oo : j = {X k y k=QCi . E is the operator of 
expectation: a subscript will be added in case of ambiguity, 
so that Eg and Eh are expectation when 9 is the true state of 
nature and hypothesis H is true, respectively. D(-|j-) denotes 
the Kullback-Leibler divergence operator. The acronyms a.s. 
and a.e. stands for almost sure and almost everywhere. N 
denotes the set of natural numbers, i.e. {1,2, . . .}, Z the set 
of integers, R the set of real numbers and M + the set of 
positive real numbers. Finally, the notation h v ~ g v means 
that lim h v /g v = 1. 

v— >0 

II. Problem formulation 

Consider a dynamic system with a Markov evolution. Xi, 
i € N, is the state vector at time i and § is the state space, 
with cardinality M. In particular, {Xi}^^ forms a discrete- 
time, homogeneous Markov chain with given initial distri- 
bution 7r and transition probabilities a(xi,Xj) = P({X k = 
Xi}\{X k -i = Xj}), Xi,Xj £ S. A sequence of states 
{Xi}^ =1 , often called trajectory, is denoted with ICx-.k an d 
has density Pk^i-.k) = Yli=2 a ( x i-ii x i)> w i tn respect 

to the counting measure. {Xi}^ is observed through a set 
of noisy measurements. The measurement process is {Zjjjgpj, 
and the sample space of each Zi is (Z, Y), "V being a a- 
algebra of subsets of Z. Consider a a-finite measure v on 
(Z,y). If the signal {A;} ieN is present, {(A7,Z;)} ieN is a 
HMM: given a realization {xi}i 6 N of {A^}i £ N, {^i}ieN is a 
sequence of conditionally independent random variables, each 



Zi having density f(z\xi) with respect to v. On the other 
hand, if the measurements contain only noise, {Zi}i e ^ is an 
i.i.d. process, each Zi having density f(z\9o) with respect to 
v. Thus, for every k E N, the joint distribution of Zi :k has 
conditional density 



/fc(zi :fc |xi :fc ) = ^/(ZjI^), 



if the signal is present 



i=l 
k 



and Xi :fc = xijfc, 

/fc(zi:fc|0o) = \\ f{zi%), if the signal is not present, 

<=i 

with respect to v , 

Given these elements, one is to sample the process {Zi}i^ 
sequentially and decide, as soon as possible, if measurements 
are generated by noise alone or if they come from a dynamic 
system. In the latter case, it can be also required to estimate 
the system trajectory which has generated such measurements. 
The parameter space, then, is {6>o} U ( X ieN S). As in [l]-[3], 
whose focus, however, was on non-sequential decision rules, 
there is a mutual coupling of detection and estimation and two 
different strategies may be adopted. Indeed, the structure of the 
decision rule can be chosen so as to improve the detection or 
the estimation performance. The former case is called a weakly 
coupled (or uncoupled) design while the latter a strongly 
coupled (or coupled) design. In both cases, the estimator is 
enabled by the detection operation: this gating, however, can 
be (possibly) optimal for the detection or for the estimation. 
However, the problem of designing sequential procedures for 
detection and estimation is considerably more difficult than 
that of devising FSS procedures [30] and the approach taken 
in general is to extend and generalize the SPRT designing 
a practical, possibly sub-optimal, rule [11], [12], [14], [15], 
[21]-[23]. In this paper the uncoupled strategy is adopted, 
this choice being motivated by a number of reasons: it has 
a very simple structure; as shown in section [IV] it exhibits 
many optimal properties; detection is the primary interest in 
many practical applications, as for example, radar surveillance 
problems later discussed. 

III. Detection and estimation procedure 

A sequential decision rule is the pair (ip,£,), where tp = 
{</9fc}fceN is a stopping rule and £ = {£fc}fceN a terminal deci- 
sion rule [29]. Since detection and estimation are performed 
in parallel, the terminal decision rule is itself composed of 
a detection rule 5 — {5 k }k£N for testing the signal presence 
and of a trajectory estimator x = {x k }k£N, i-e. £ = (S, x). The 
proposed (non-randomized) sequential decision rule is, then, 




1, if A fc (z 1:fe ) £ (70,71)) 

0, otherwise, 

1, if A fc (zi :fe ) > 71, 
0, if Afc(zi; fc ) < 70, 



(la) 



(lb) 



Xk(zv.k) =argmaxpk(xi : fc)Afc(zi:fc|xi : k) ) if A fc (zi :fc ) > 71, 

x 1:fc es fc 

(lc) 
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where A fe (z 1:fe ) = Lx 1:fc es* Pfc( x l:fc) A fc( z i:fc| x i:fe)> 
A fe (zi :fe |xi :fc ) being the likelihood ratio of / fe (zi :fc |xi :fe ) 
to f k (z 1:k \9 ). 

Notice that the pair (ip, S) is an SPRT for testing Hq = 
'noise only' against the alternative Hi = 'signal present', no 
matter of its trajectory. Hi, then, is the hypothesis that Zi- k 

has density f k ,H^i:k) = E Xl:fc es fc Pfc( x i:fc)/fc( z i:fel x i:fc)> 

V k £ N. The strength of such a sequential test is the pair 
of probabilities of errors of the first and second kind, a and 
(3, respectively (often, in detection problems, a is referred 
to as probability of false alarm, Pf a , and f3 as probability 
of miss, -P m i ss ). Denoting with r the stopping time and with 
"0 — {"0fc}feeN its conditional distribution^ these probabilities 
of error are given by a = Xfc G N E eo [^fe(Zi : fe)4(Zi :fe )] 
and (3 = E fceN E ffl ty*(Zi :fc )(l - 4(Z 1:fe ))j, - 
Z^eN^i [^fc( z i:fe)^fe(Zi:fc)] being the probability of de- 
tection^ The boundaries of the test, 70 and 71, with < 70 < 
1 < 71 < +00, are chosen in order to have the required 
strength (a, (3). 

As concerns x only, it can be considered a gated estimator 
since estimation is enabled by the detection rule. Furthermore, 
consider the triplet (r, Xi :T , Zi :r ). Since 

P({t = k, X 1:r = xi-fc, Z 1:T £ A k }\{r < +00, 
6 T (Z 1:T )=1,H 1 }) = 

= P({r = k, X 1:r = xi ;fc , Zi ;T £ A k , S T (Z 1:r ) = l}\Hi) = 

P({T<+00,* r (Zi :r )=l}|F 1 ) 
= ^cT 1 / P( x l:fc)/fe( z l:fcl x l:fc)^fe( z l:*)4(zi:fc)^ fc (zi:fc)) 

V fc e N, xi :fc £ § k and A k £ a(Zi :k ), 
P d " 1 p( x i:fc)/(zi:fe|xi :fc )V'fc(zi : fc)4(zi:fc) is the density of 
(r, Xi iT} Zi :T ) I {accept Hi, Hi true}. This means that 
^cT 1 LfceNP( x i:fc)/( z i:fe|xi : fc)-0fc(zi : fe)(5fc(zi : fc) is the density 
of (Xi :T , Zi :r )|{accept Hi, Hi true} so that 

arg maxy^p(xi :fc )/(Zi : fc|xi : fc)V)fc(Zi :fc )(5fc(Zi;fe) 

is a MAP estimator conditioned upon the event 
{accept Hi, Hi true}, xi- k being the projection of 9 on 
§ fe . Since the above estimation rule is exactly that in ([Tel l, it 
results that a; is a MAP estimation rule conditioned that no 
error of the first kind is made by the detector. 

Finally, as to the computational complexity, the 
sequential decision rule in (UJ requires to evaluate 
the statistics E Xl:fc eS fc P*t( x i:fc) A fc( z l:*l x i:k) and 
max Xl:fc6S fc Pfc(xi:fc)A fc (Zi :fe |xi :fc ) for k = 1,---,t, where 
p fc (x 1:fc )A fc (Zi :fc ) = 7r(a;i) ^|^ Ui=2 0(^-1, J») /(^jgoj • 
These statistics, then, have the form of a stage-separated 
function on the algebraic system (M, +, •) and (M, max, •), 
respectively, and can be computed through two dynamic 
programming algorithms [31] (similar to the forward- 
backward procedure and to the Viterbi algorithm as in 
[32]), whereby the computational complexity is only linear 

ipk( z l:k) i s the probability that r = k given a realization z 1;fc of Z 1:fc , 
for any k S N; the relationship between tp and ip is: ipi (,z%) = ■pi (z\ ) and 

i>k( z l:k) = V( z l:k) IllZl i 1 - ¥>( z l:«)). for k > 1. 

2 Notice that /? + P d = P({t < +oo}|i?i). 



in k. Notice that maximization in ( [Tel l is preferred to 
argmax Xi:fc6S fc Pfc(xi:fc)/fc(zi:fc|x 1:A .) since, in the former 
case, the estimator can work on the same data as the detector, 
thus further lowering the complexity. 

IV. Asymptotic analysis 

Let £r( a ',0'), a', (3' £ (0,1), be the class of non- 
randomized tests (denoted with (N, d), N being the stopping 
time and d the terminal decision rule), either sequential or FSS, 
with probability of error of the first and second kind bounded 
by a' and j3', respectively. It is known that Wald's SPRT for 
testing a simple hypothesis against a simple alternative based 
on i.i.d. observations has the following remarkable properties 
[28], [29], [33], [34]. 

i) If the test has strength (a, (3) and boundaries 70, 71, 
then 



ol< (l-(3)/ji < I/71 and < (1 



a) Jo < 7o- 
(2) 



ii) The test ends a.s. under both hypotheses. 

iii) The ASN is finite under both hypotheses. 

iv) The ASN is minimized among tests in the class 
$*(a',(3') under both hypotheses. 

Except property (01, which is easily shown to hold under very 
general conditions [35], the other properties in general do not 
hold in the present setting since the observations {Zi}i & ^ 
are not independent. This section is devoted to studying the 
asymptotic behaviour of the sequential test when the two 
error probabilities simultaneously approach zero. It will be 
demonstrated that, under rather mild hypotheses, the procedure 
satisfies also properties djiji and dTTTT > and, asymptotically, drvb . 
In particular, it will be shown that every finite moment of the 
stopping time is first-order asymptotically minimized in the 
class 37~(a' ,(3'). The regularity conditions are stated below. 

Condition 4.1: The Markov chain {X k }k£t'S is stationary, 
irreducible and aperiodic. 

Condition 4.2: The family of mixtures of at most M ele- 
ments of {/(-|a;)} xe § is not equal to /(-|#o) f-a.e., i.e., for 
every distribution c on §, X^eS c ( x )f('\ x ) and f('\@o) are 
equal v-a.e.. 

Condition 4.3: For every x £ §, Ej^ In 
+00, i = 0, 1, 



f{Zi\x) 



f(Zi\e ) 



< 



Condition 4.4: There exists 

f(Zi\x^ N " 



( f(Zi\x) V 

\f(Zi\e )J 



EiJ " [\f(Zi\9 : 
all x, y £ §. 



and E 



a constant a 

( f{Zl\x) \ ±a 



> such that 
are finite, for 



Condition 4.5: ^ f[z\el) ) ^ *~* ^ or everv x E § and 



z £ 



Z. 

Condition 4.6: The matrix containing the transition proba- 
bilities {a(x, y)}x,yes is invertible. 

Remark 4.7: Since the Markov chain {AT^gN is homoge- 
neous and has a finite state space, condition 14. 1 1 corresponds 
to requiring that {Xj}i E N be stationary and ergodic, which 
will be seen to imply {Zi}^] to be stationary and ergodic 
as well, an essential property for the limiting theorem to 
be presented. As concerns condition 14.21 it can be shown 
(recursively) that it ensures the two densities /fc(-|#o) an d 
fk,Hi(-) are not ^-a.e. equal for every k £ N: otherwise, for 
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some k £ N it could not be possible to discriminate between 
statistical populations drawn from these two distributions, 
i.e. detection could not be possible. Finally, 14.31 - 14.61 are 
essentially 'regularity' conditions which allow to derive the 
limiting behaviour of the log-likelihood ratios {lnAfc}fc 6 N. 
Notice, furthermore, that the moment conditions 14.41 imply 
14.31 since there always exists a finite constant C such that 
|lniy| < 2Ccosh(alnu)) = C (w a + w~ a ), for any w > 0. 

It turns out that the validity of properties <fTT]> - dlv]) is highly 
influenced by the limits A; = lim k~ 1 F<H i [lnAfc(Zi : fc)l, 

k — >+oo 

i = 0, 1, in the case that they exist finite and non-zero [20]. 
For this reason, it is first given the following (proof is given 
in the Appendix). 

Theorem 4.8: If conditions 14. 11 - 1431 are fulfilled, there exist 
finite constants Ao < and Ai > such that 

lim -E Hi [lnAfc(Z 1:fc )l = A,;, 

k— >+oo K 

lim — In Afc(Zi : fc) = Aj, a.s. under Hi, i = 0, 1. 

These conclusions hold for any initial probability with strictly 
positive entries (i.e. not necessarily the stationary one) used 
in the definition of {Afc}fc S N and A^ have the same value. 
If the log-likelihood ratios {In A-ki^i-.k^keN satisfy this sort 
of 'stability' property, it can be easily demonstrated the 
following 

Proposition 4.9: Under conditions 14. 1 1 - |4~3l the test ends 
a.s. under both hypotheses, i.e. P({r < +oo}\H^J = 1, i = 
1,0. 

Proof: Define the two auxiliary stopping times tq = 
infjfc S N : A fc (Z 1:fc ) < 7o } and t x = 
infjfc e N : A fc (Z 1:fc ) > 71}. From theorem |4~8l 
lirrifc^+oo fc _1 In Afe(Zi : fc) = Ai, a.s. under Hi, i = 0, 1, with 
Ao < and Ai > 0. This implies that 

p({ fc Uin o A fc (Z 1:& )<7o}|Fo) =1, 
p({ fc Uin o A fe (Z 1:fe )>7 1 }|JJ 1 ) =1, 

which means that P({n < +00}^) = 1, i = 0,1. The 
thesis, then, follows form the fact that t — min{ro,ri}. □ 
The 'stability' of {In Afe(Zx : fc)}fegfsj of theorem l4~8l is sufficient 
also to imply the asymptotic optimality of the test in the sense 
of the following 

Theorem 4.10: Suppose that conditions 14.11 - 14.31 are ful- 
filled, 70 and 71 are chosen so that the test belong to £T{ol ', /?') 
and In 71 ~ In ^7, In 70 ~ In /?' as a' + /?' — > 0. Then 

1 



I In/?' I a'+/3'^0 |A | 

r 1 



— — a.s. under H Q , 
a.s. under H±. 



I In a' I q'+/3'—o Ai ' 
Furthermore, for every e £ (0, 1), it results 



inf P({N > er}\Hi 

{N,d)esr{ a ',i3') 



-> 1, 



0,1. 



(3) 

Proof: Under conditions 14.11 - 14.31 the conclusions of 
theorem 14.81 hold and then theorem 1 of [20] can be used. □ 



Notice that equation (f2]i allows to choose the appropriate 
thresholds: indeed, it implies that 71 = 1/a' and 70 = /3' re- 
sult in a test which belongs to 2?(a\ /?'). Moreover, regarding 
(O as approximate equalities (i.e. by neglecting overshoots) 
leads to the approximations (see [33]) 



71 w (l-/3)/Q 1/a, 70 ~ £/(l-a) P- (4) 

Notice that <(3j does not imply asymptotic optimality of the test 
(the optimality criterion of the Wald-Wolfwitz theorem [34] is 
about the minimization of the ASN under both hypotheses). 
With the a.s. convergence of {fc _1 In Afc(Zx : fe)}fcgN alone, the 
following can be proved. 

Theorem 4.11: If conditions 14. 1 1 - 14~31 are satisfied then 



lim inf 



E 



> 



1 



and 



lim inf 



E 



Hi 



In a' 



> 



1 



|ln/?r - |A |- 

for every positive constant r. 

Proof: As in [21, theorem 2.2], it is sufficient to apply 
the Markov's inequality. Indeed, it results 



E 



t|A | 



> e r P 



AM 



>e\H 



ln/3'|y J " VI I In I 1 "J J a>+/3>- 

for any r > and e G (0, 1), where the limit follows 
from theorem [4. 101 Since e is an arbitrary constant in (0, 1), 



|ln/3'| 



> 1, which 



it follows that liminf Q / +/ 3'^o Eff, 

proves the first inequality. The other one can be proved 
similarly. □ 
In order to guarantee finiteness of the expected sample size and 
to obtain its first-order asymptotic minimization, condition |43j 
must be strengthened requiring 14.41 - 14.61 to hold. Indeed, the 
following can be proved (proof is given in the Appendix). 

Theorem 4.12: Suppose that conditions gj] @72l 031 - WM 
are fulfilled, 70 and 71 are chosen so that the test belongs to 
&(a',/3') and In 71 - In A, In 70 - In/?' as a' + (3' 



< +00, z = 0,1, and. 



0. 

as 



E* [r r ] 



E 



Hi I 



iA r : 

jlnaT 



Then, for every r € N, E# t [r r 

a' + /?'^0, 

inf V Ha [N r }~ 
{N,d)e3r{ a >,i3') 

inf E Hl [N r }~ 

{N,d)ES{a>,f3') 

The &(\) term is due to the overshoot In (A r (Zi :T )/ 7 o) or 
In (A T (Zi :T )/ 7 i), analogous to Wald's lower bound [33] for 
the sample size (which is attained when these overshoots are 
ignored). 

From theorems 14.101 and 14.121 it results that the asymptotic 
behaviour of the test is determined by the constants A^, 
i = 0,1. These constants are often difficult to evaluate and, 
thus, approximations or bounds can be useful. To this end, 
the following propositions, whose proofs are given in the 
Appendix, are presented. 

Proposition 4.13: Constants Aj, i = 0, 1, satisfy the follow- 
ing 



x<£S 



\\ \ <*£w(x)D(f(.\e )\\f(-\x)), 



(5a) 



(5b) 



x<£S 
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where ir is the unique stationary distribution of the Markov 
chain {Xi} i( z®. 

Proposition 4.14: If for every distribution c on § 

D(E c(x)f(-\x)\\f(-\e )) = D(E c'(x)f(-\x)\\f(-\9 Q )), 

(6a) 

D(/(-|^o)|| E c(x)f(.\xj) =d(/(.|<? )|| E c'(x)f(-\x)), 

(6b) 

for any permutation c' of c, then 

dU E f{-\y)\\f{-\8»)) < At < D(f(-\x) || /(.|0 O )), 
v yes ' 

(7a) 

£>(/(-l*o)||& E A(-|y)) < |Ao| <D(f(.\o ) || /ON)), 

(7b) 

where D(f(-\x) \\ f(-\0 o )) and £>(/(-|0 o ) || f(-\x)) assume 
the same value for any x E §. 

Notice that the upper bounds in d7bl are attained if 7r(x) = 1 
for some x G § and a(:r, a;) = 1, Vi e S, while the lower 
bounds if ir(x) = 1/M, Vi S S, and a(x,y) = 1/M, Vx,y E 
8. 

V. Example of application: the radar case 

The radar problem is characterized by the inherent presence 
of multiple-resolution elements, which correspond to range 
'bins' as well as Doppler, azimuth and elevation cells. This 
problem has been solved in [11], [16], [24] but all of these 
approaches concern the case that the target is not allowed 
to change its position while being illuminated by the radar. 
This condition may be too restrictive, especially in airborne 
applications where the relative radial velocity between target 
and radar may exceed Mach-2. The surveillance area is divided 
into smaller angular regions, each visited in turn by the antenna 
beam in cyclic manner. In each region a sequential procedure 
is used to accept or reject the hypothesis that a single target 
is present. The measurement process is obtained dividing the 
region into a grid and discretizing the continuous-time received 
signal accordingly (if the grid is sufficiently fine, losses due to 
possible mismatches may be neglected). The measurement at 
epoch i e N, also called frame, is the set of returns received 
from all of the radar resolution elements, i.e. Zg — {Zg(x) : 
x E {1, . . . , N a } x {1, . . . , N e } x {1, . . . , N r } x {1, . . . , N d }}, 
where N a ,N e ,N r , N d are the number of resolution elements 
in azimuth, elevation, range and Doppler, respectively. The 
target signature appears on at most one resolution element in 
each frame. The target state space consists of the set of all 
the resolution cells, i.e. S = {1, . . . , N a } x {1, . . . , N E } x 
{1, . . . , N r } x {1, . . . , N d }, with M = N a N e N r N d . If also 
velocities are to be considered, then the state space can be 
enlarged consequently. A first-order Gaussian-Markov random 
walk model is used to derive the transition probabilities, which 
are given by a(x l ,x l+1 ) = at(x iit , x i+lt i), where x iyl 
denotes the £-th component of the target state vector at epoch 



i and 

a t (Xix,x i+ i } i) =Q\^ ) + 

\ at J 

In the above equation, Q(x) — ^= dt and ag 

is a parameter related to the target mobility along the €-th 
dimension: large values of ag allow large target maneuvers but 
decrease, at the same time, target detection and estimation ca- 
pabilities0 As concerns the initial probability, if no other prior 
information is available (for example previous detections), it 
is reasonable to force n(x) = 1/M, for all x E §. 

It is supposed that the components of the measurement Zi 
are independent, each Zi{x), x E S, being an exponentially 
distributed random variable with density 

hi(v) = u ( u )i if tne target is present in location x, 

1 + p 

(8a) 

ho(v) =e ~ v u(v), otherwise, (8b) 

where p denotes the signal-to-noise ratio (SNR) and 
u(y) is the unit-step function. In this case f(zi\xi) = 
hi(zi(xi))H^i ho(zi(x)), f(zi\9 ) = U xe sh (zi(x)) 
and 

Mzi:tH n^-n 1+P • 

(9) 

Notice that, as it can be easily checked, this model satisfies 
conditions 14.21 - 14.61 and, also, equations (0. In particular, 
for condition 14.41 to hold, it is sufficient to choose a < 1/p. 
This means that the test first-order asymptotically minimizes 
any positive moment of the stopping time distribution. Even in 
the above situation, however, occasionally long observations 
can be needed. Furthermore, if there are mismatches between 
design and actual values of some parameters (for example, 
the SNR) the resulting ASN can be very large, especially for 
small error probabilities. Truncation of the procedure then can 
be used to prevent such a problem: when a fixed sample K is 
reached, hypothesis Hi or Hq is accepted whether A^-(Zi : / c ) 
exceeds jk or not, respectively. The impact of truncation on 
the system performances as well as the problem of the final 
threshold setting is not explored further here and the reader is 
referred to the past literature [37]-[39]. 

VI. Numerical results 

The behaviour of the sequential procedure has been tested 
through Monte Carlo simulations in terms of ASN and P t rack, 

3 Notice that, even if all transitions are theoretically admissible, real targets 
necessarily need to satisfy physical constraints, such as limitations on the 
maximum velocity and acceleration. In this case, a truncated Gaussian density 
can be used. 

4 The model of equations [8] - [9] applies, for examples, if measurements 
come from a square law envelope detector, the noise is additive, white and 
Gaussian, the target has a Swerling-I fluctuation model and frequency agility 
is used to achieve frame-to-frame target amplitude decorrelation. This is a 
common situation in radar scenarios [36]. 
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p=0dB, a'=10 , P'=1(T 



M=10, a'=10 , P'=1<T 





Fig. 1. ASN under both hypotheses versus the mobility parameter a for 
p = dB and different values of M. The markers denote the values resulting 
from the asymptotic approximations of the ASN of theorem 14.121 



the probability of correct track estimation. First, a general 
problem of detection and trajectory estimation is considered 
in order to reinforce the discussion in sections [Til] and [IV] 
For simplicity, the measurement model is that of equations 
(O - (O, even if there is no explicit reference to a radar 
scenario. The state space is S = {1, . . . , M} and the transition 
probabilities are derived from a truncated Gaussian distribution 
with standard deviation u using a quantization step of 10~ 4 . 
The boundaries 70 and 71 have been set using equation (0]), 
where the design error probabilities a' = (3' = 1CP 3 have 
been adopted. In figure Q] the ASN under both Hq and H\ 
is plotted versus a for p = dB and for various M. Since 
the model satisfies conditions 14.11 - 14.61 the approximations 
for the ASN of theorem [4.121 hold: the difference between the 
approximations and the true ASN is due to the excesses of 
lnA T (Zi :r ) over boundaries. Furthermore, equations (O are 
fulfilled so that the bounds on Ao and Ai of proposition 14. 141 
hold and the extrema of the ASN are (asymptotically) reached 
for a(x, x) = 1 V x e S and a(x,y) = 1/M V x,y 6 § 
(c = 1CP 1 and a = 10 4 given the adopted quantization step). 
It is confirmed then the intuitive idea that more compact priors 
allows easier detections. Figure [2] shows the effect of the SNR 
on the probability of correct track estimation. It has been also 
plotted Ptrack±i, the probability that the distance between each 
state of the recovered trajectory and the actual state is less then 
or equal to 1 . Notice that, since an uncoupled design has been 
adopted, the estimation performances decrease as p is lowered 
and/or a is increased while a and (3 are not influenced by these 
parameters (indeed lower values of p and/or larger values of 
<7 are traded for larger ASNs). 

The remaining curves concern more specifically the radar 
scenario outlined in section [Vl The search zone is composed of 
a single elevation and 4 azimuth sectors; the other parameters 
are N r = 100 and Nd = 16. The transition probabilities 
along the third dimension (i.e. range) are defined as above, 
with a maximum admissible range transition of ±3 bins. 
Azimuth transitions are neglected while Doppler ones are 
assumed equally likely (i.e. 02 = and 04 = +00). The 



Fig. 2. Probabilities of correct track estimation P trac k and Ptrack±i versus 
the mobility parameter a for different values of p and for M = 11. 



a =0.8, p'=12dB 




Fig. 3. ASN and coefficient of variation versus the SNR under Hi. The 
dotted line represent the sample size of the equivalent FSS rule. 



truncation stage is K = 20, with jk = ^loli, while the 
nominal SNR has been set at p' = Tvjfl (notice that, since 
it is not realistic to assume prior knowledge of the target 
strength, the actual SNR is in general different from the design 
value p'). The subsequent plots are aimed both at assessing 
the effect of a mismatch between p and p' and at giving 
a comparison with an equivalent FSS procedure exhibiting 
the same Pf a . In figure [3] the ASN and the coefficient of 
variation (CV) of the sample siz^| are represented versus p 
under H\. Notice the characteristic peak at intermediate values 
of the SNR: yet the effect of the beam antenna remaining 
blocked monitoring a particular direction has been avoided 
by truncation. As for the FSS rule, a conservative choice 
has been made taking 4 samples, which is uniformly larger 
than the ASN of the truncated sequential procedure. Figure @] 
shows Pd and Ptrack versus the SNR for both the sequential 
and the FSS procedure. It can be seen that the sequential 
procedure achieve larger P& over all the inspected range of 
SNRs maintaining, at the same time, a full sample size saving. 
Notice also the massive gain granted in terms of P trac k which is 
mainly due to the low ASN required. Finally, figure [5] shows 
the performances in terms of the target mobility. Pi, p ra ck, 

5 If each frame results from processing pulse trains of pulses, p' = N$ 
corresponds to an SNR per pulse of dB. 

6 The coefficient of variation of a random variable is the ratio a/p of its 
standard deviation a and its mean p 7- 0. 
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a =0.8, p'=12dB 




p'=p=12dB. 



Fig. 4. Probability of detection and correct trajectory estimation versus the 
SNR for both the sequential and the FSS procedures. Vertical axis in Gaussian 
scale. 




■Ftrackrti an d the ASN are represented versus 173 for p = p' 
(recall that a = 10 _1 corresponds to the case of a steady 
target). It can be seen, that, while the probability of detection 
of the FSS procedure impairs as the target mobility increases, 
that of the sequential rule remains almost unchanged, in that 
large values of 03 are counterbalanced by higher ASNs. As 
to the estimation performance, it obviously decreases in both 
cases, but sequential techniques retain their superiority over 
all the range of 3 , 

VII. Conclusions 

The general problem of sequential detection and possible 
trajectory estimation of a dynamic system observed through 
a set of noisy measurements has been considered. Previous 
limitation on the system dynamics imposed by other works 
present in the literature have been removed and a thorough 
analysis of the asymptotic behaviour of the test has been 
presented. In particular, it has been shown that, under rather 
mild conditions, the test is asymptotically optimal, in the 
sense that it minimizes, up to a 0(1) term, any moment of 
the stopping time distribution under both hypothesis as the 
probabilities of error approach zero. Possible applications to 
radar surveillance problems have been inspected. Finally, the 
numerical has confirmed the correctness of the given approx- 
imations and has demonstrated the merits of the proposed 
strategy with respect to other competitors in the context of 
radar surveillance. 

Appendix 

In the following, the proofs of theorems 14.81 14.121 and 
propositions 14.131 14.141 are given. 

Proof of theorem \4-8\ Part of the proof borrows its 
arguments from [40]. Since condition |4.1| is satisfied, {A^jie^ 
is stationary and ergodic and, from [40, lemma 1], {Zi}i^ is 
stationary and ergodic as well. Given the one-sided stationary 
process {(Xi, Zi)} , it is extended to a two-sided stationary 
process {(Xi, Zi)} ieZ in the usual way. 



Fig. 5. Probabilities of detection and trajectory estimation versus the range 
target mobility parameter for both sequential and FSS procedures (top); ASN 
in the lower plot. 



Define 

q k (z 1:k \x) 



f(zi\x) 



J(Z2\X2) 



i=3 



x k es 

f(Zj\Xi) 

f(zi\e )- 



<7fc(zi:/c) =max<ft.(zi;fc|a;), 

xGS 

with q±(zi\x) — f^\eo) • Since q k ( z i:k\ x ) is the likelihood 
ratio Afe(zi : fc) given that X\ = x, it results that, V zi :k S Z k 
and k E N, A fe (zi :fe ) < q k (z 1:k ) and 



A fe (z 1:fc ) = ir(x)q k (z 1:k \x) > max {ir(x)q k (z 1:k \x)} 



> 



> max 



\ mm{TT(y)}q k (z 1:k \x)\ = min {Tr(x)}q k (z 1:k ). 



Combining the above inequalities on k k (z\ :k ) it results that 

- In mm {7r(xi)} < - In — < 0, 

k xiSS k q k (z\; k ) 

Vzi :fc e Z k and fceN, which implies that 

lim ilnAfc(Zi.fc) = lim ~ lng fc (Zi : fc),a.s. under Hi, 

lim yE Hi [lnA fc (Z 1:fc )] = lim - E Hi [lng fc (Z 1:fe )] , 

for i = 0, 1. As a consequence, it is sufficient to demonstrate 
the conclusions of the theorem for q k (Zi :k ). The advantage of 
working with q k rather than A^ is two-fold. First, q k does not 
depend upon the initial probability it and, then, the second 
part of the theorem is demonstrated. The second advantage 
descends from the following relationship 

q s+t (zi :s+t ) < q s (zi. s )q t (z s+1:s+t ), V S,t > 1, (A.l) 
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for any sequence {zi}i S N (the proof is identical to that of [40, 
lemma 3]). Define now a doubly indexed sequence of random 
variables {W st }t> s >o by W st = lng t _ s (Z s+1:t ). With this 
definition, the stochastic process {W s t}t>s>o satisfies the 
following three properties. 

1) From equation jA.lt . W s t < W su + W u t, V s < u < t, 
i.e. it is a subadditive process. 

2) By the stationarity of {Z k } keZ , {W st } t>s > is station- 
ary relative to the shift transformation W s t W s +i t+i, 
i.e. W st and W s+ i t +i have the same distribution. 

3) By condition 14.31 for i = 0,1, it results 



= E 



H, 



<E E ^ 

xes 



max 

xeS 



111 



In 



f(Zi\x) 

f(Zi\e ) 

f{Zi\x 



f(Zi\e a ) 



< 



< +oo. 



where v + = max{0, v}, v G R. 
By the Kingman's subadditive ergodic theorem [41, theorems 
1.5 and 1.8], a stochastic process satisfying these properties 
also satisfies the conclusions of the ergodic theorem, i.e. 

i) limfc^ +00 k~ 1 Wok — W < +oo exists almost surely; 

ii) E[W] = lim fe ^ +00 fc- 1 E[W 0fc ]; 

iii) W is degenerate if the process is ergodic. 

Thus, an application to Wok — ha<j'fc(Zi : fc) gives (ergodicity 
carries over from {Z k } k ^) 

lim - E ffs [\nq k (Z 1:k )] = A, < +oo, i = 0, 1, 



k— >+oo A; 
exists and 

1 



lim - lng fe (Zi :fe ) = A, : 



a.s. under Hi, i = 0,1. 



Since hr 1 In Afe(Zi : fc) and A;^ 1 lng , fe(Zi : / s ) have the same 
limiting behaviour, the proof of the theorem is complete if 
one demonstrates that Ao is finite, strictly negative and that 
Ai is strictly positive. 

In order to prove that Ao is bounded also from below, first 
notice that Afc(zi : fc) is a convex combination of the terms 
/fc(zi :fe |xi : fc)//fc(zi :fc |6>o), for any z 1:fe G Z k and k £ N, and, 
thus, 



lnAfe(zi : fc) >ln min 

xi :fc eS' 



= > min In 

n—l 



/fc(Zl :fc |Xl;fc) 

fkZl:k\0o) 

f(Zn\x) 
f(Zn\0 O , 



k 

E 



where {rj(Z k )} ke ^ forms a sequence of i.i.d. random variables 
under Hq. Furthermore, by condition 14.31 it results that 



. , f{Zi\x) 
mm in — ; ; — - 

xes f(Zi\6 ) 



< +oo. 



E ffo [|r,(Zi)|] = E Ho 



This implies that 

A = lim -E Ho [lnA fc (Z 1:fc )] > E Ho ^(Z,)] > -oo. 

As for the sign of Ai, i — 0,1, let g n (Zn\ z i:n-i) 
denote the conditional density given by the ratio 



/n,ffi(zi:n)//n-l,iJi(sBl:n-i) for ?i > 2 and by fi tHl {zi) for 
n = 1. With this notation, the limiting constants Ai are also 
given by 

A, = lim yE Hi [lnA fc (Z 1:fc )] = 



1 

= lim - } ~E Hi 

k-y+oc k ^— ' 



n=l 
k 



lim - V" E ffi 
fe^+oo k ' 

n— 1 



hi 



hi 



,9n(^n|Zi:„_i) 

3n(^l|Z-n+2:0 
/(Zi|0o) 



0,1, 
(A.2) 



where stationarity of {Zj}j 6 z has been exploited. On 
the other hand, g n (Zi |Z_ n+ 2 : o) can be written as 
Exesf(Zi\x)P({Xi = x}\Z_ n+2:0 ), V n G N, and this 
implies that 

. /(^iN) . g„(Zi|Z_ n+2:0 ) /(Zi|ar) 
mm — ; < , „ ; < max — ; , n£N, 



xes /(Zi|0 o ) 



and, by condition 14.31 that |ln 



xes f(Zi\e y 

g„(Zi|Z-„ +2:0 ) 1 

J nf 



/(Zi|0 o ) 

quence of uniformly integrable random variables. In this case, 
dominated convergence gives 



lim E/^ 

n — >+oo 1 



hi 



<?n(^l|Z_ 



-n+2:Q) 



f(Zi\e ) 



In 



g(Zi|Z_ oo:0 ) 
/(^i|0 O ) 



(A.3) 



0, 1. In the above equation, g(Zi\Z- oo: o) denotes the limit 

lim g„(Zi|Z_„ +2: o) = 

n — *+oo 



= lim J2f( Z l\ x ) P ({ X i= x }\ Z 

n— >+oo * — • 



-n+2:0 I 



xeS 



xeS 



f(Z 1 \x)P({X 1 =x}\Z_ oo:0 ), 



(A.4) 



where the latter equality follows form a martingale conver- 
gence theorem by Levy (see [42]). Finally, from dA.3b and 
from the Cesaro mean theorem follow that A^ in equation ( IA.2I ) 
can be also written as 



Ai =E 



Hi 



hi 



g(Zl\Z-oo: ) 



i = 0,1. 



f(Zi\6 ) 
which implies that 

A 1= E ffl [D(.g(.|Z_ oo:O )||/(-|0o))] >0, 
Ao = - E Ho [D (/(•|0 o )|| 5 (-|Z_ oo: o))] < 



(A.5a) 
(A.5b) 



Inequalities in jA.5i descend from the fact that the Kullback- 
Leibler divergence is always non negative and is equal to zero 
if and only if the two densities are equal v-a.e.: this, however, 
cannot happen since, from equation ( lA.4t . (/( • |{Z„oo : o = 
z-cc:o}) is always a mixture of M elements of {f(-\x)} xe §, 
which is not i^-a.e. equal to f(-\0o) by condition 14.21 □ 
Proof of theorem \4.12\ 
Exploiting the idea introduced in [26], [43], the likelihood 
ratio can be equivalently represented as Afc(Zi : fe) = |LMfe7r||, 
k e N, where || • || is the Li-norm on R M and Mk is a M x M 
matrix on R defined as follows 



Mi = Ii, Mk = XkA'Mk-i, for k > 2, 
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(•) T denoting transpose, T fe being a diagonal M x M matrix 
with entries {f(Z k \x)/ f(Z k \6 )} xe §, A = [a(x, y)] x , v es 
the transition probability matrix and 7r = [^(x^gs the 
initial probability vector. Under conditions 14.51 and 14.61 Mk is 
invertible for every zi : & G Z k , k £ N, while, under hypothesis 
Hi, {Xk, Zk}kefi is a Markov chain on Sx Z. This implies that 
the process { (Xk, Zk),Mk} keN is a multiplicative Markovian 
Process (see [44, definition 1.1]) and {Mk} keN a product of 
Markov random matrices. 

In order to exploit the large deviations result for products 
of Markov random matrices in [44], it is first needed to verify 
the validity of conditions A of [26] and [44]. Under condition 
14.11 {Xfc}fe 6 N is uniformly ergodic and so is the Markov chain 



{{Xk, Zk)} k( - N (see [45]), i.e. condition Ai is fulfilled. As 
concerns A2, it is requested that there exists p > such that 



E 



jp{ln||M fc |Lln||M 1 II } i „ „ 
1 I 11= II ll= fc II J X.I = Xl, Zl = Zl 



< 



(A.6) 

\/(xi,Zi) G § x Z and k = 0, 1, where the expectation is 
taken with respect to the joint distribution of {X k , Z k } kG ^ 
and the matrix norm is that induced by the vector norm, i.e. 
||Mfc|| = sup||„|| =1 ||Mfcw||. Now it results that 



HI^IMI^" 1 !} < \\T 2 A T Ti\\ p + \\(T 2 A T Ti) 



< 



< 



\Y + (\M 



To 



with 



m\ 




( f{Zk\x ) 



( f(Z k \x) 

\f(z k \e ) 



i = ±1. This implies that equation (IA.6b is satisfied if 

\Xi = xi,Zi = zi 



E 



( f(Zi\w) f(Z 2 \x) 

\f(Zx\e ) f(z 2 \e ) 



< +oo, 



y-w,x,xi G § and zi G Z. But 

±p 



E 



( f(Zi\w) f{Z 2 \x) 

\f(Zi\e ) f(z 2 \e ] 

f(zi\w) x 



f(zi\e ) 

f(zi\w) 



E 

yes 



yes 



1*1 
a(xi,y) E y 



xi,Z 1 



±v 



a{xi,y)f(z\y)v(dz) 



( f(Zi\x) 
\f(Zi\0o) 



which is bounded for every (w,z\) G S x Z by condition 
14.51 and for every xi,x,y G S by condition 14.41 As concerns 
condition A3, the process {(Xk, Zk), Mk} keN has to be 
strongly irreducible and contracting (see [26, definition 2] 
for the terminology) and this can be shown using arguments 
similar to those in [26, proof of proposition 4]. 

Since these three conditions are satisfied, it results that: 
i) there exists a neighbourhood / of the origin such that, 



E 



\MklL\\ I X x = Xi,Zi = 



for every p E I, 

z\ j converges to a function H(p) uniformly in 
(x\,zi) G § x Z [44, section 4 and theorem 4.3]; 



ii) H'(0) = Ai [44, proposition 3.8]. 
These two properties will be used to derive the convergence 
rate of the sequence {fc^ 1 In Afe(Zi : fc)}fc 6 N- To this end, define 
the function 



H(p) — lim sup — In E^ 

k^+00 k 



= plnA fc (Zi :fc ) 



= limsup-mE Hl [Ag(Zi, fc )]. 

k — >+oo K 

Given the uniform convergence in (xi,Zi) and recalling that 
||Mfc2r|| = Afe(Zfe), property (Q) implies that 

lim i ]nE Hl [A p k (Z 1:k )} = \nH(p), Vp G /, 

and then H(p) = \nH(p), \/p G /, so that, from property (EO), 
H'(Q) = Xi. Denote now with T> ^ the set {p G K : H(p) < 
+00}. Since, from condition 14.41 



E ffl [Ag(Z 



l:fej 



<E 



-Hi 



n 



, f(Z n \x 
max ; ; — - 

*es \f(Z n \0 o ) 



< 



EE E * 



( f(Zi\x) 

\f(Zi\e 




< +00, 



it follows that H(a) < +00 and, thus, the interior part 
of Dfi contains the point p = 0. This and the fact that 
H'(0) = Xi implies that {± In A k (Z 1: k)} keN converges to Ai 
exponentially in k [46, exercise 2.3.25], [47, theorem IV.1]Q 
The exponential convergence is obviously much stronger 
than the a.s. convergence granted by theorem [4.81 Indeed, the 
former implies that 



+00 

^rp({|fe- 1 lnA fc (Z 1:fc )-A 1 | >e}|iTi) < +00, 
fc=i 

Ve > and r > 0, which in turn implies that 
{k~ x In Afc(Zi:fe)} fc converges r-quickly to Ai for any 
r > [48, lemma 3]0 The latter is also called strong complete 
convergence in [21], [22]. Given the r-quick convergence for 
any r > 0, since Ai is finite and strictly positive from theorem 
14.81 the thesis follows from [20, corollary 1]. 

The case under Hq can be handled similarly considering 
{Zk,Mk} keN , where {Z fe } fceN i s now an i.i.d process. □ 
In order to prove propositions 14.131 and 14.141 the following 
lemma is first needed. 

lemma A.l: D (£ x£S c(as)/(-|a:) || f(-\6 )) and 
D(/(-|#o)|| 12xgS c ( x )f('\ x )) are convex function on 
the set {{c(x)} xeS G [0, 1] M : Exes c ( x ) = l j- 

Proof: It can be verified exploiting Jensen inequality and 
convexity of functions — lnw and v\nv, v G M + . □ 
Proof of proposition \4.13\ From the proof of theorem 
equations (1A.41 > and (1A.5I ). Ai can be also written as 



7 A sequence {Yfej^gM of random variables is said to converge exponen- 
tially to a constant A if, for any sufficiently small e > 0, there exists a constant 
C such that P(\Y k - A| > e) < e- fcC [47]. 

8 A sequence {lfe}fcgr* of random variables is said to converge r-quickly 
to a constant A, for some r > 0, if E[T e r ] < +00, for all e > 0, where 
T e = sup{fc G N : \Y k - A| > e} (sup{0} = 0) [49]. 
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Eft [D(E x£S /(-k)P({X 1 = x}\Z^)\\f(-\e ))\. On 
the other hand, lemma IA.1I and Jensen inequality allow to 
write 



D ( E f{'\x)P({X t = ^KZ-^o = z_ co:0 })||/(-|e )) < 

< P({Xl = 4|{Z-oo:0 = Z-co:0}) D (f(-\x)\\f(-\e )), 



for every realization z_oo : o of Z_oo : o, whereby 



]T P({Xl = 4|Z~co:0) d (f(-\x)\\f(-\e )) 



= ^E ffl [p({X 1 =i}|Z_ ooi0 )]D(/(.| a! )||/(.|flb)) = 

x£$ 

= ^(x)D(f(-\x)\\.f(-\9 )). 

cces 

The upper bound on |Ao| can be proved similarly. □ 
Proof of proposition \4.14\ From ©, setting c(x) = 1 
for some x € S, it follows that D (f(-\x)\\f(-\9 )) has the 
same value for every x E S: this, along with proposition 14. 131 
demonstrates the upper bound on Ai. As to the lower bound, 
exploiting lemma I A. 1 1 and Jensen inequality, it follows that 

D (Exes c ( x )f(-\ x )\\f(-\9o)) = 

>D(E, eS /(-WiTE^ 1 <WII/(-^o)) = 

= D (£^/(^)IIWo)L (A.7) 

\x£S I 

for every probability vector c on S, where {c^}^£ ! 2 , 
is the set of all of the possible permutations of c 
and c\ = c. From the demonstration of theorem 14.81 
equations jAAj and dA.5l ), Ai can be written also as 



thus, 



_D (Exes K\*)P({Xi = x}\Z_ oo:Q )\\f(.\0 Q )) 
exploiting jA.7) , it results that 



and 



Ai > Eh 1 



D(Exg 8 W-WII/OI^)) 



= D(E, 68 £/(>)II/(W)> 

and the lower bound is proved. The bounds on |Ao| can be 
proved similarly. □ 
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